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Abstract 

In this paper we introduce a new random mapping model, T^, 
which maps the set {1,2, ...,n} into itself. The random mapping 
is constructed using a collection of exchangeable random variables 
D\, D n which satisfy Y^i=x^i = n - ^ n the random digraph, G® , 
which represents the mapping T®, the in-degree sequence for the ver- 
tices is given by the variables Di,L>2, ...,D n , and, in some sense, G® 
can be viewed as an analogue of the general independent degree mod- 
els from random graph theory. We show that the distribution of the 
number of cyclic points, the number of components, and the size of a 
typical component can be expressed in terms of expectations of various 
functions of D\,D2, -.,D n . We also consider two special examples of 

which correspond to random mappings with preferential and anti- 
preferential attachment, respectively, and determine, for these exam- 
ples, exact and asymptotic distributions for the statistics mentioned 
above. Results for the distribution of the number of successors and 
predecessors of a typical vertex in G^ in terms of expectations of var- 
ious functions of D\, D2, D n are obtained in a companion paper 
[23]. 
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1 Introduction 



The study of random mapping models was initiated independently by several 
authors in the 1950s (see [8, 18, 19, 25, 31, 40] ) and the properties of these 
models have received much attention in the literature. In particular, these 
models have been useful as models for epidemic processes, and have natural 
applications in cryptology (see, for example, [9, 10, 11, 15, 17, 20, 29, 30, 33, 
34, 37, 38, 42]). To date, the most widely studied models have been special 
cases of a general model denoted by T p(n ), which can be defined as follows: 
Let [n] denote the set of integers {1, 2, n} and let M n denote the set of all 
mappings from [n] into [n]. For each n > 1, let p(n) = {pij(n) : 1 <i,j < n} 
be an array such that Pij{n) > for 1 < i,j < n and ~YTj=iPij{ n ) = 1 f° r 
every 1 < % < n, and let X™, X^ , X™ be independent random variables 
such that Pr{X™ = j} = p>ij(n) for all 1 < i, j < n. Then the random 
mapping T p(n ) : [n] — > [n] is defined (in terms of the variables X™, Xg, X™) 
by 

r p(B) (i)=j iff X? = j (1.1) 

for all 1 < i,j < n. It follows from (1.1) that the distribution of T p(n ) is 
given by 

n 

Pr{T p(n) = f} = l[p tm (n) (1.2) 

i=i 

for each / G M. n - Any mapping / G M n can be represented as a directed 
graph G(f) on a set of vertices labelled 1, 2, n, such that there is a directed 
edge from vertex i to vertex j in G(f) if and only if f(i) = j. So G p ( n ) = 
G(Tp( n )) is a random directed graph on a set of vertices labelled 1,2, ...,n 
which represents the action of the random mapping T p ( n ) on [n\. We note 
that since each vertex in C p ( n ) has out-degree 1, the components of G p ( n ) 
consist of directed cycles with directed trees attached. Also, it follows from 
the definition of T p ( n ) that the variables X™,X%, . . .X™ can be interpreted 
as the independent 'choices' of the vertices 1, 2, . . . , n in the random digraph 
G-p( n ) (see, in addition, Mutafchiev [35] and Jaworski [27]). 

The example of T p ( ra ) which is best understood is the uniform random 
mapping, T n = T p ( n ), where Pij(n) = \ for all 1 < i, j < n . Much is known 
(see for example the monograph by Kolchin [32]) about the component struc- 
ture of the random digraph G n = G(T n ) which represents T n . Aldous [1] has 
shown that the joint distribution of the normalized order statistics for the 
component sizes in G n converges to the Poisson-Dirichlet (1/2) distribution 
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on the simplex V = {{xi} : Yl x i — 1? > x i+i > for every i > 1}. 
Also, if Mk denotes the number of components of size k in G n then the joint 
distribution of (Mi, M2, . . . , Mb) is close, in the sense of total variation, to 
the joint distribution of a sequence of independent Poisson random variables 
when b = o(n/\ogn) (see Arratia et.al. [5], [6]) and from this result one 
obtains a functional central limit theorem for the component sizes (see also 
[21]). The asymptotic distributions of variables such as the number of pre- 
decessors and the number of successors of a vertex in G n are also known (see 
[10, 11, 15, 29, 30, 33, 34, 37]). In another direction, Berg, Jaworski, and 
Mutafchiev (see [12, 27, 28, 29, 30] ) have investigated the structure of G p ( n ) 
when p(n) is given by Pu(n) = q for some < q < 1 and all 1 < i < n, and 
Pij(n) = for all 1 < i, j < n such that i 7^ j. Finally, Aldous, Miermont, 
and Pitman (see [2] and [3]) have recently investigated the asymptotic struc- 
ture of Gp( n ), where p(n) is given by Pijin) = Pj(n) > for all 1 < i, j < n, 
by using an ingenious coding of the mapping T p(n ) as a stochastic process 
on the interval [0, 1]. Their results are closely related to earlier work on the 
relationship between random mappings and random forests (see Pitman [36] 
and references therein). 

The common feature in all the models discussed above is that each ver- 
tex in G p („) 'chooses' the vertex that it is mapped to independently of the 
'choices' made by all other vertices. In this paper we introduce a new random 
mapping model in which the vertex 'choices' are not necessarily independent. 
The definition of the model is motivated, in part, by developments in the 
general theory of random graphs. In recent years models for random graphs 
with a specified degree sequence have received much attention as models for 
complex networks such as the internet. Loosely speaking, such a random 
graph on n labelled vertices can be constructed by starting with a collection 
of i.i.d., non-negative, integer-valued random variables Di, D 2 , D n and 
adding edges, at random, to the graph until each vertex % has degree D; L in 
the constructed random graph. Of interest in such models is the relationship 
between the component structure of the random graph and the distribution 
of the variables Di, D2, D n . In another direction random graph models 
with 'preferential attachment' have been constructed in order to model the 
evolving structure of complex networks. In such models edges are added 
sequentially to the graph and new edges are more likely to be attached to 
vertices that already have relatively high degree in the evolving graph. The 
literature on these new developments in the theory of random graphs is ex- 
tensive, but a good bibliography is provided by Bonato's survey paper [14]. 
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In this paper we consider a random mapping analogue for both the in- 
dependent degree models and the preferential attachment models described 
above. We show that in the case of random mappings these analogues are 
equivalent and we develop a calculus for determining the distributions of var- 
ious important random mapping statistics which is based on the underlying 
distribution of the in-degrees of the vertices in the directed graph which rep- 
resents the random mapping. The paper is organized as follows. In Section 
2 we define carefully define the new random mapping model. In Section 3 we 
derive the calculus for this new model. In Section 4 we define both a random 
mapping model with preferential attachment and with anti-preferential at- 
tachment. We show that both of these models are equivalent, under certain 
distribution assumptions, to the special examples of the general model de- 
fined in Section 2. For these special examples we investigate the distribution 
of the number of cyclic points, the distribution of the number of components, 
the probability of connectedness, and the distribution of the size of a typical 
component. 



2 The model 

In order to define our new random mapping model, we adopt the following 
notation. For n > 1, M. n denotes the set of all mappings / : [n] — > [n], where 
[n] = {l,2,...,n}, and G(f) denotes, as described in the Introduction, the 
directed graph on n labelled vertices which represents the mapping / e Ai n - 
In addition, for 1 < % < n, di(f) denotes the in-degree of vertex i in the 
digraph G(f), and we let d(f) = (d^f), ...,d n (f)). 

Now suppose that D 1 , ....,D n is a collection of exchangeable random va- 
riables such that Y^i=i Di = n, then we construct a probability measure 
on M n as follows. For / G M n) define 

P?{/} = n ? =l( y /))! Pr{A = *(/), 1 < i < n). (2.1) 

It is clear from the definition of that for any f,g£ Ai n such that 
d(f) = d(g), we have P®{f} = Pnio}- Given the probability measure 
P^, we can define the random mapping which takes values in M. n and 
has distribution given by 

= /} = Pf{/} (2.2) 
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for every f E M n , and we let = G(T®) denote the random digraph on 
n labelled vertices which represents T® . 

An important class of examples can be constructed as follows. Let D\, 
D2, ■ ■ ■ , D n be i.i.d. non- negative integer- valued random variables. It does 
not make sense to construct directly a random mapping digraph with in- 
degrees given by Di, D 2 , D n since the sum of the vertex in-degrees in a 
random mapping digraph on n vertices always equals n. So, instead, we 
let Di, D 2 , D n be a sequence of random variables such that their joint 
distribution is given by 

n 

Pr{ Di = 1 < % < n) = Prj D { = d h 1 < % < n | £)< = n} 

i=i 

and we use Di, t>2, D n to construct and G%. This gives us the natural 
analogue of the general i.i.d. degree model discussed in the Introduction. We 
also remark that it is easy to check that if Di, D 2 , D n are i.i.d. Poisson{\) 
variables, then the corresponding random mapping T® is just the usual uni- 
form random mapping. We will see in Section 4 below that there are inter- 
esting interpretations in the cases where the underlying i.i.d. variables 
Di,D 2 , ---jDn have (i) a generalised negative binomial distribution, and (ii) 
a Bin(m, 1/m) distribution for some m > 2. 



3 Results 

In this section we develop a calculus, in terms of the variables Di, D 2 , D n , 
for determining the distributions of various random variables associated with 
the structure of G® . The first variable we consider is the number of cyclic 
vertices in the random digraph G„. 

A vertex i G [n] is a cyclic vertex for the mapping / G Ai n (and for the 
corresponding digraph G(f) if there is some k > 1 such that f k \i) = i, 
where is the k th iterate of the function /. We define X n (f) to be the 
number of cyclic vertices of / G M. n and we let X® = X n (T^) denote the 
number of cyclic vertices in G%- Then we have 

Theorem 1. For 1 < k < n — 1 

Pr{X,f = k} = — ^—E{(bi - 1)1)^2 ■■■D k ) 
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and 

¥i{X° = n}= Pr{A = 1, 1 < i < n}. 

Proof, (of Theorem 1) We begin by considering the case 1 < k < n — 1. For 
/ G M. n , let C n (f) denote the set of cyclic vertices for the mapping /, and 
let £® = C n (T°). Then we have 

Pr{Xf = k}= Pr ^ = L}. 

LC[n]s.t.\L\=k 

We fix L — {1, 2, k} = [k] and observe that 

Pr{/£ = [*]} = 
Prj^J = [k] Di = di,l <i < n|Pr{A = d h 1 < % < n). 

s.t. Yl d^ = n 

Now fix d = (d±, c?2, <^n) such that J^ILi di = n and let 

A4 n (d) = {/ e M n : = di, 1 < ^ < n} . 

Then it follows from (2.1) and (2.2) that 

\{f E M n (d) : £ n (f) = [k]}\ 



Pr £f 



D,- = d;A < i < n 



-i 



In order to count the set {/ G A1 n (d) : C n (f) = [A;]}, we first construct a 
bijection between the set, 5^, of sequences x = (xi,x 2 , ■■■,x n ) such that for 
any x G and 1 < i < n, we have \{m : x m = i}| — di and the set of 

— * 

mappings M n (d). 

The bijection is defined in terms of an algorithm which, for any sequence 
x = (xi,X2, x n ) G Sr, constructs, after a series of 'rounds', a corresponding 
mapping / x G A4(d). Informally, the algorithm works as follows. In the 
first round the vertices i G [n] with di — are mapped (in a way which is 
determined by the sequence x) to vertices in [n] which have in-degree greater 
than 0. After the first round the 'availability' of some vertices to receive 
directed edges in the corresponding directed graph will have been reduced 
because the algorithm has mapped some vertices to them. A vertex i becomes 
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'unavailable' as soon as it is 'full', i.e. di vertices have been mapped to it. So, 
at the beginning of the second round some vertices which were 'available' in 
the first round may be 'unavailable' at the start of the second round. These 
new 'unavailable' vertices are mapped to 'available' vertices in the second 
round and this construction continues until the mapping / x is completely 
defined. 

In order to describe rigorously the algorithm which constructs / x , we 
introduce the following notation. For any vertex % G [n], any m > 1, and 
any x G Sj, we let a,i(m, x) denote the 'availability' of vertex i at the start of 
round m. Initially we set Oj(l,x) = di for all i G [n\. The values of Oj(m, x) 
when m > 2 are determined recursively by the algorithm. For x G Sj and 
m > 1 we define the sets 

3> m (x) ={i£ [n] : a t (m, x) = 0}, ^(x) = 0, Z m (x) = y m (x) \ y m _i(x). 

Finally, for x G <S^, let ?/o( x ) — and for m > 1, let y m (x) = |3^ m (x)| and 
z m (x) = |Z m (x)|. 

The algorithm: Given x e 5 d -, the corresponding mapping / x G M n (d) is 
constructed as follows. 
Step 1. Set m = l. 

Step 2. If Z m (x) ^ 0, then 

• list the elements of Z m (x) in increasing order: %\ < i 2 < ... < i Zm U)- 
For 1 < k < z m (x), define / x (ifc) = x y m -i(x)+k, where x t is the t th term 
in the sequence x = (x±, x 2 , x n ). 

• Next, for each j G [n], set 

aj(m+ l,x) = aj(m,x) - \{i G 2 m (x) : / x (i) = j}|. 

• Go to Step 3. 

Otherwise, if Z m (^) = (i.e.^ m _i(x) = ^ m (x)), then 

• list the set [n] \ 3^m( x ) i n increasing order: %\ < i 2 < ... < i n -y m M- 
For 1 < k < n — y m (x), let f x (ik) = x y rn (^)+k- This completes the 
construction of the mapping / x . 
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Step 3. Set to = m + 1 and go to Step 2. 

We mention here that we have recently learned that Blitzstein and Diaconis 
[13] have used an algorithm to construct labelled trees with a given degree 
sequence which is similar in spirit to the algorithm described above. It is 
clear from the definition of the algorithm, that if x, x' G and x ^ x', 
then / x 7^ / x /. In particular, if fc(x, x') = min{/c G [n] : x k ^ x' k } then 
there is some i G [n] such that f x (i) = #fc(x,x') an d /x'(0 = ^Hxx'V Since 
\Sj\ = di " [ d j = |.M n (d) |, it follows that the algorithm constructs a bijection 

between Sj and .M n (ci). Hence, to count {/ G M n (d) : £ n (/) — [&]} it 
suffices to count {x G : C n (f x ) = [k]}. To this end, we prove the following 
lemmas. 

Lemma 1. For each x G Sj, x = (x±, x 2 , x n ) we define t(x) as follows: 
t(x) = min {t : \{x t ,x t+1 , ...,x n } \ = n - t + l} . 

Then 

£-n{fx) = (^t(x), ^t(x)+lj •••) x n\- 

Proof. ( of Lemma 1 ) Suppose that x G and suppose that the algorithm 
terminates in round to, then the first step is to show that 

£n(/x) = N \ ^rh(x) . (3.1) 

We begin with a few simple observations. First, it follows from the description 
of the algorithm that at the beginning of the m th round, where 1 < to < to, 
the vertices in [n] \ 34i-i( x ) have not yet been assigned their image under 
the mapping / x . Also, for every i G [n], we have 

aj(m,x) = | {A; : x fc = i,y m _i(x) + 1 < k < n}\. (3.2) 

In other words, Oj(to, x) = r if and only if i appears r times in the sequence 
( x 2/m-i(x)+i> ^j/ m _i(x)+2, •••,^n)- In addition, it follows from (3.2) and from the 
definition of y m (x.), that 

™ - 2/m-l(x) = ^ Oj(TO, X) = ^ Oj(to, x) 
i6 [n] «6[n]\y m (x) 
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since Oj(m, x) = for every i G ^m(x). Now if m — rh, then ^m(x) = 
^m-i(x) and so 

n - ym(x) = n - y^_i(x) = a;(m,x). 

ie[n]\y A (x) 

This implies immediately that 

ai(m, x) = 1 for all i G [n] \ }m(x). (3.3) 

Hence the last step of the algorithm generates a permutation on the set 
N \ ^m(x) i.e., 

{Wi'-' 1 ^ = ^ \ £ A»(/x) • (3.4) 

Suppose now that i G 3-m( x ) (i-e., i G [n] and Oj(m, x) = for some 1 < m < 
rh — 1). Then, since 

3Mx) c 3Mx) c ... C ^_!(x) = y A (x), 

we can define 

TOj = min{m : Oj(m, x) = 0}. 

If mi = 1, then aj(l,x) = di = 0, and we must have i ^ £ n (/x) since 
vertex i has in-degree in G„(/ x ). So, suppose that 2 < m; < rh — 1 (and 
= Oj(l,x) > 0). It follows from the description of the algorithm and the 
definition of m^, that at the start of round vertex % is 'unavailable' and 
/~ 1 (i) = {j : /x(j) = i} Q ymi-i- Since the value of / x (z) is determined 
during round mj and since aj(mj,x) = 0, it follows that / x (i) 7^ «■ It is 
also clear from the description of the algorithm that for any j G [n] \ y mi -i, 
we have / x (j) 4- iVmj-i since, for every fc G X^-i an d for every m > 
a,k(m, x) = (i.e., vertex fc G 3^-1 is 'unavailable' in every round m > mi). 
It follows by induction that 

for every £ > 1. Now suppose that i G C n (f x ), and in particular, that 
= i for some f > 2, then ff~ l \i) G /^(i) C ^ mi _L But this 
contradicts (3.5), so we must have % £„(/ x ). Therefore 

^rn(x) C [n]\C n {f x ). 
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This together with (3.4) implies (3.1). 

Next, we note that (3.3), (3.4), and the definition of t(x), imply 



{% H+ l.-,^n} = [n] \ ^rn(x) C {x t(x ) X n }. 

Thus to show that 

[n] \ ^m(x) = {x t(x ), ....,x n }, (3.6) 

it suffices to show that 

X Vm(x) ^ {^(x)) ••••) x n} ■ 

Suppose that x Vrh(x) G {x t ( x ), x n }. Then in the m — 1 st round there is 
some f G Zm-i such that f is assigned to x yrh(x) . So at the beginning of 
the rh — 1 st round of the algorithm x v . , , was available, but it is unavailable 
at the start of the rh th round (since x yih , } ^ [n] \ y^). Since the rh — 1 th 
round of the algorithm cannot generate new unavailable vertices, we have 
a contradiction. Hence (3.6) holds and Lemma 1 follows immediately from 
(3.1) . □ 

It follows from Lemma 1 that 

{x G <%: £„(/x) = [k]} = {x G <%: {x n _ fc+ i, ...,ic„} = [A;] and x n _ fc G [k]}. 
So routine counting arguments yield 



Pr = [fc] 



L>i = di,l < i < n 



|{x G «S d -: {xn-k+i, ...,x n } = [k] and x n _ k G [fcjjj x [ILi d % 
= (^j ^2 dl ^ 2 ' ' ' di-i(di - l)did i+1 ■ ■ ■ d k 



i=l 

for 1 < k < n — 1. Hence 



= [*]} = Q " (^) E#i-(A- 1) A • • • Dk) 
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(^"^((^-UAiV-i*) 



since L>i,D 2 , ■■■,D k are exchangeable. Repeating the argument given above 
and using the exchangeability of the variables D ± , D n , we also obtain 

Pr{C = L}= (») " 1)44 • • • 4) 

for any L C [n] such that |L| = fc. Summing over all such sets L, we obtain 

Pr{Xf = A;} = -^-£((4 - 1)44 • • • 4)- 

Finally, we consider the case k = n. Clearly X% — n if and only if / x is 
a permutation of [n] , and / x is a permutation of [n] if and only if Z)j — 1 for 
all 1 < i < n. Hence 

Pr{xf = n} = Pr{ A = 1, 1 < i < n} 
as required, and the theorem is proved. □ 

Let JVjf denote the number of components in G%- Let cr(m) be a uniform 
random permutation on an m-element set and let denote the number 

of cycles in the random permutation cr(m). Then we have 

Corollary 1. For 1 < £ < n, 



+MMlp r {A = 1, 1 < i < n] , 
where s(- , •) are the Stirling numbers of the first kind. 

Proof. Recall that the mapping restricted to the cyclic vertices of , 
is a permutation of C®. The corollary follows from the observation that the 
number of components in equals the number of cycles in the permutation 
of £° by . So N° = I if and only if the mapping T% restricted to C% is 
a permutation with £ cycles. 
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First suppose that l<£<k<n— 1, and L C [n] such that \L\ = k. 
Also fix the in-degree sequence d — (di, d 2 , d n ) such that Y^i=i = n, and 
such that di > 1 for every i 6 L and dj/ > 2 for some G L. Recall that for 
any /, g G .M n W we have P^(/) = P^(g), so it follows from the bijection 
described in the proof of Theorem 1 that 



Pr 


= « 


£j = L,A = di,l<i<n] 








{x G iSj* : {x n ...,x n } = L, x n k G L, permutation 


/x| 


L ha 


s £ cycles} 








^} 










= Pr{iV CTW = £}. 









Since the set L C [n] and the degree sequence d were arbitrary it follows 
that 

Pr {< = £|xf = A;} = Pr{iV CT(fe) = £}. (3.7) 



In the case X% = n, the mapping is a random permutation of [n] (since 
= [n]), so for 1 < £ < n we also have 



Pr {AT? = = n} = Pr{iV a(n) = £}. 

It follows that 

n 

Pr{< = £} = ^ Pr {Arf = ^ = k] Pr{xf = A;} 



(3.8) 



k=t 



= Y,Pr{K {k) =t}Pr{X° = k}. 



k=i 



Let s(- , •) denote the Stirling numbers of the first kind, then it is well known 
that there are \s(k,l)\ permutations of /c-element set with exactly I cycles, 
i.e., 

\s(k,l)\ 



Pr{N a{k) = £} 
which implies the assertion of the corollary. 



k\ 



□ 
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Let B® denote the event that the random graph is connected. Then 
since = {N^ = 1}, we obtain the following result immediately from the 
proof of Corollary 1: 

Corollary 2. 



k 
fc=i 



n-1 



_ E{(D 1 -l)D l D 2 ---D k ) Pr{A = l,l<^<w} 

^ n — k n 

k=i 

Finally, suppose that £i, £ 2) ••• is a sequence of independent indicator vari- 
ables such that, for k > 1, Pr{^ = 1} = | and such that £1,^2, •••• and X® 
are independent. It is well known (see [16]) that for m > 1 



Nairn) ^^k, (3-9) 



d 

k=l 



and that (N a ^ m ) — log m) / ylogm converges in distribution to the standard 
iV(0, 1) distribution. It is an easy consequence of (3.7)-(3.9), that 

Corollary 3. For n > I, 

(3-10) 



fc=i 



We note that (3.10) is useful for investigating the asymptotic distribution of 



N n 



D 



Next, we consider the distribution of the size of a 'typical' component of 
G%. Let Ci(n) denote the size of the component in which contains the 
vertex 1, then the distribution of Cf (n) is given by the following theorem. 

Theorem 2. For 1 < t < n, let D[, D' 2 , D' e be a sequence of variables with 
joint distribution given by 

1 

Prji^ = di, 1 < % < = PrjA = <U, l<i<i\j2^ i = i }' 

i=i 

then 

Pr{c? (n) = ^ = £ - Pr{Bf } Pr{ £ A = 

i=i 
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Proof. Fix 1 < £ < n and let Cf (n) denote the vertex set of the component 
of which contains the vertex 1. Then we have 



Pr{Cf(n) 



Pr{C?(n) = C}. 



CC[n] s.t. ISC 
and \C\=t 



Now fix C = [£] and observe that if Cf (n) = [£] then rf must map [£] into 
[£], and, in particular, we must have Y^=i A = £■ So 

Pr{Cf (n) = [<]} = Pr{cf (n) = | £ A = A = ( 3 ' U ) 

i=i i=i 

— # 

Next, by summing over all degree sequences d = (c?i, d 2 , <i n ) such that 
X)i=i di = £ and X^=i = n > we obtain 



Pr{cf(n) = [<||X;A 

i=l 



E p r{ c?(n) 



A = d«, 1 



< « < n j 



xPr 



{A = 



d;A<i<n 



££, = «}. 



(3.12) 



Now fix d = (di, d„) such that X)i=i di = £ and J^ILi ^ = n ' anc ^ define 
M(c?i, d 2 , ^) to be the number of mappings / : [£] — > [£] such that = 
for 1 < « < £ and G^(/) is connected. Then we have 



Pr 



{C? (n) = [£] 



Dj — dj, 1 < i < n 



}- 



M(di, £^)(ra — £)\ dj\ ■ ■ ■ d n \ 

~ X ■ 



n 



M(d u ...,di) x 



d e+1 \ ■ ■ ■ d n \ 



n! 



(") ^{Bf |^ = *,1 <*<*}■ (3.13) 
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— * — * 

Now for any degree sequence d — (d 1 , ...,d n ), we define dg = (d 1 , dg) and 
d/ = (dg+i, d n ) (so d = (dg, dg)). Then, by substituting the RHS of (3.13) 
into (3.12), we obtain 

i 

Pr{c?(n) = fl| = 

i=i 

= (ff E E Pr{BT|«-*,l<i<f} 



s.£. s.t. 



xPr{A = <ii, 1 <^<n| 5^A = ^} 

i=i 

(") E Pr{BT|^ = *,l<<<4 Pr { £> * = di ' 1 - < - € } 



") _1 pr{BT}. (3.14) 
Substituting (3.14) into (3.11), we obtain 

Pr{<? = [£]}= (") _1 pr{BT}Pr{ £ A = 
^ ' i=i 

It follows by the same argument and from the exchangeability of the variables 
Di, L>2, D n that, for any C C [n] such that 1 G C and |C| = £, we have 

Pr{c? = = ("VVrjsf }Pr{ £ A = 

^ ' i=i 

So 

p,{cf (B ) = 4 = (;)"p r { e f } Pr { ± a = /} 



i=i 



ipr{6r}Pr{i:A = /} 



n 

4 = 1 

as desired. □ 
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We prove an extension of Theorem 2 in the following special case: Suppose 
that Di,D 2 ,... is a sequence of i.i.d. non-negative integer valued random 
variables. For each n > 1, let D(n) = (A,n, A,n) be a sequence of 
variables with joint distribution given by 

n 

Pr|A,n = di for i = 1,2, ..,nj = PrjA = di, i = 1,2, ...,ra | A = 

i=i 

Let Cf ^ denote the vertex set of the connected component in Gn^ = 
G(Tn^) which contains the vertex labelled 1. For k > 1, we define 
recursively as follows: If [n\ \ (Cf (n) U • • • U cf_ ( ™ } ) ^ 0, let Cf (n) denote the 
vertex set of the connected component in which contains the smallest 

element of [n] \ (Cf {n) U • • • U cf_ ( ™ } ); otherwise, set Cf (n) = 0. For all Jfe > 1, 
let cf (n) = |Cf (n) |, then we have 

Theorem 3. Suppose 1 < k < n and £i, £ 2 , . . . ,£). are su °h that 1 < £i < n , 
1 < £j < n — £ — i\ — ■ ■ ■ — < n, for i = 2, k, and Yli=i ^ — n - Then 
we have 

Pr{C 1 D< " , = £ 1 ,...,Cf"> = 4} = 



x Pr <J 4> =4 1 < « < 

j=ti_i+i 



where to = and ^ = + ... + ^ , % = 1, 2, k. 

Proof. For any integers < m\ < m 2 , let [mi,m 2 ] denote the set of integers 
{mi, mi + 1, m 2 }. Then we have 

Pr{cf ( " ) =£ 1 ,...,Cf W =4} = 



1 )xPr{cf (B) = [t i _ 1 + l,t i ] fori = 1,2, ...,*}. (3.15) 



i=i 
Since 



[Cf (n) = [U-! + 1,4 l<i<fc}c ^ A> = 4 1 < » < k 



j=*i_i+i 
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we have 



Pr {Cf (n) = + 1M fori = 1,2, ...,£;} 



Pr <" Cf {n) = + 1, tj for i = l,2, fc 



j=*i_i+i 



xPr<] ^ Dj, n = ti, l<i<k\ . 

j=*i_i+i 



(3.16) 



Let .4 = {d = : X)*i t ._ 1+1 dj = 4 1 < * < and Y!j=idj = n}, 



then 



+ l,tj] for % = 1,2, ...,k 



j=*i_i+l 



D{n) 



[U-i + l,ti\ for % = 1,2, ...,k 



D jjTl = dj, 1 < j < n 



xPr I D jn = dj, l<j<n 



j=ti_i+i 



1< i < fc 



(3.17) 



Now for any 1 < % < k, let M(d ti _ 1+ i, d< i _ 1 +2, d^) denote the number of 
mappings of + 1, tj\ into + 1, tj\ such that the corresponding directed 
graph on the vertex set + 1, ^} is connected and the in-degree of each 

— * 

vertex + 1 < m < U is d m . Then for any d <E A, we have (as in the proof 
of Theorem 2) 



Pr <{ Cf (n) = + M*] fori = 1,2, 



-q /n-ti-A 1 x jj M(d u _ 1+1 ,. ..,d ti ){d ti _ 1+1 )\ ■ ■ ■ d u \ 



i=i 



i=l 



4! 



fe / , x — 1 fc 



i=l 



i=l 



D jA = d u _ 1+J loil<j<£A. (3.18) 
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The last equality follows by 're-labelling' , for each 1 < i < k, the vertices 
U-i + 1, ...,U by 1,2, ...,£i. 

Next we note that since the variables Di, D 2 , -D n are independent and 
identically distributed, we have for every d G A 



Pr <J D jjn = dj, l<j<n 



j=*i_i+i 



n 

i=i 



Pt{Dj = dj, + 1 < j < ti} Pr{D, = dj, t fc + 1 < j < n} 



x 



Pr {ELt,+i Dj = n- t k } 



K 

JJPr{4- A = d ti _ 1+i , 1< j<€*} 

i=l 



X 



Pr{D 3 - = dj, h + 1 < j < n} 
Pr {ELt,+i D J = n-t k } 



(3-19) 

where we replace the the last factor in the product by 1 if tk — n. Substituting 
(3.18) and (3.19) into (3.17) and summing over all d G A, we obtain 



Pr<£ 



D(n) 



+ 1,U} for % = 1,2, ...,k 



D i,n = £i, 1 < i < k 

j=ti-! + l 



n 



n - ti-i 



npr{<^}. 



-1 k 

x 

i=i \ / j=i 

Finally, substituting (3.20) into (3.16) and then substituting (3.16) into 
(3.15), we obtain the result. 



(3.20) 



□ 



In the general case where the variables Di, D 2 , ■■■,D n are exchangeable 
and Er=i Di = n, an analogue of Theorem 3 can be proved. However, the 
result for the special case of Theorem 3 is easier because we are able to 
exploit the fact that the variables D\, D 2 , .. are i.i.d. to obtain the product 
in (3.19) and hence the product in (3.20). Theorems 1,2, and 3 and their 
corollaries illustrate how the distributions of random mapping statistics for 

can be computed in terms of the variables D 1: D 2 , D n . Similar results 
for local properties of T® such as the number of predecessors and the number 
of successors of a given vertex (or vertices) are obtained in a companion paper 
[23]. 
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4 Examples 

We consider two special examples which correspond, respectively, to a ran- 
dom mapping with 'preferential attachment' and a random mapping with 
'anti-preferential attachment'. 

A Preferential Attachment Model 

In this section we investigate T£ : [n] — > [n], a random mapping with 'pref- 
erential attachment', where p > is a fixed parameter. For 1 < k < n, 



we define T£(k) = X^ n) where X\ p ' n \ X ( 2 p ' n> , I^"' is a sequence of ran- 



dom variables whose distributions depend on the evolution of an urn scheme. 
The distribution of each xjf' n ^ is determined by a (random) n-tuple of non- 
negative weights a(k) = (ai(k), a,2(k), a n (k)) where, for 1 < j < n, a,j(k) 
is the 'weight' of the j th urn at the start of the k th round of the urn scheme. 
Specifically, given a(k) — a — (a±, a n ), we define 



The random weight vectors a(l), a(2), a(n) associated with the urn scheme 
are determined recursively. For k — 1, we set ai(l) = a 2 (l) = • • • = a n (l) = 
p > 0. For k > 1, a(k) depends on both a(k — 1) and the value of as 
follows: Given that xjfj!^ = j, we set aj(k) — aj(k — 1) + 1 and for all other 
% 7^ j, we set aj(fc) = aj(/c — 1) (i.e. if X^^ = j then a 'ball, with weight 1 
is added to the j th urn). 

The random mapping T£ as defined above is a preferential attachment 
model in the following sense. Since, for 1 < k < n, Tfi(k) = xjf' n \ and 
since the (conditional) distribution of xjf' n ^ depends on the state of the urn 
scheme at the start of round k, it is clear that vertex k is more likely to be 
mapped to vertex j if the weight dj(k) is (relatively) large, i.e. if several 
of the vertices 1, 2, k — 1 have already been mapped to vertex j . In the 
following proposition we establish the distribution of T£. 

Theorem 4. Suppose that D^D^-, ... are i.i.d. random variables with a gen- 
eralized negative binomial distribution given by 




X { r ] = j\a(k) = a} 
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where p > is a fixed parameter. 

For n > 1, let D(p, n) = (D P n , D^w ■■■■> ^n,n) ^ e a sequence of variables with 
joint distribution given by 



Pr{D P n = di, 1 < i < n} = Pr { D P = d h 1 < i < n 



n 

i=i 



Then for every n > 1, the random mappings T£ : [n] — > [n] and Tn^ p ' n ^ : 
[n] — > [n] have the same distribution. 

Proof. To prove the result it is enough to show that for any n > 1 and any 

feM n 

Pr{T p = f} = Pr{TfM = /}. 

— * 

Suppose that / G M. n and that d(f) = (d±, d 2 , d n ). It is straightforward 
to check that 



Pr{L>f„ = <U,l<i<n} = PT<D? = di,l<i<n 



-q T(dj + p) ^ n!r(np) 



'" - n 



t\ (^) !r (P) nn + np) 
since the distribution of XT=i * s gi yen by 

)' 

PJ 



It follows from the definition of tF^'^ that 



for A; = 0,1,... . 

(4-1) 



n! (di)ir(p) T(n + np) 



nr=iR+p-i; 



di 



(np + n- 1)„ 

where = m(m — 1) • • • (m — /c + 1) (and (m) = 1). 

Next, observe that if Tfi — f and d(f ) = (di, d 2 , d n ), then for each 
1 < i < n, di 'balls' of weight 1 are added to i th urn during the evolution of 
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the urn scheme described above. So it follows from the definition of T p in 
terms of the urn scheme, that 



nr=i(^+p-ik 



(np + n- l) n 

The result follows since n > 1 and f E M n were arbitrary. □ 

It is clear from Theorem 4 that the order in which a realisation of T p 
is sequentially constructed does not matter. In particular, suppose that 
ii, i^., i n is a permutation of [n] and for 1 < k < n, let T p (ik) = 
where the variables x[ p ' n \ X < ^'' n \ X n p ' n ^ are as defined above. Then it 
follows from the proof of Theorem 4 that T p ~ Tn^ p ' n ^ ~ T p . 

Since T p ~ Tn^ p,n \ we can investigate the structure of G p n = G(T P ) by 

considering the structure of Gn^ p,n \ In this paper, in order to illustrate the 
general method, we derive both exact and asymptotic results for X p , the 
number of cyclic vertices in G?, and C P (n), the size of the component in G p n 
which contains vertex 1. Further results concerning the structure of G p n can 
be found in [24] where we also consider the structure of G p n when p = p(n) 
is a function of n. 

By Theorem 1 and Theorem 4 we have, for 1 < k < n, 

Pr{x p = k} = Pt{X*M = A; j = -L-E^ - l)D p , n D p 2>n ■ ■ ■ B>Q. 

(4.2) 

Since 

E((Di n - l)D P , n D P 2 , n ■ ■ ■ DQ = E((D P - l)D p D p ■ • • D p | j^D p = n) 

i=i 

we have 

E((D p lin -l)D p , n D p 2in •••££„) = 
coefficient of s n in e((D p - l)D p s D *D p s D * ■ ■ ■ D p 
coefficient of s n in E^s D i ■ ■ ■ s D ™j 

coefficient of s n in e([D{ - l)D p s D ^ ^(Dfs^))"" 1 (e(s d i 



S D k S D k + l ■ ■ ■ S D " 



k — l / „ \ n— A; 



coefficient of s n in ^_E(s D i) j 



(4.3) 
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The last equality holds since the variables D^, D^-, are independent and 

identically distributed. Since the identity 



1 " iXfc + g) ,, 

(i_ u )« k \r(a) 



u 

AM (n) 

k=0 



holds for all a > and \u\ < 1, we have 

B (( Df - 1)D f^) = (i±£)^( T _£_)' rt . (4,) 

It follows from (4.2)-(4.4) and routine calculations that for 1 < k < n, 

Pr{ ^ = t} = * Ai+rt- rMn! 



n — fc [n — k — l)!r(np + fc + 1) 

kp\l + P ) ( { l\ (4.5) 
(rap + k) k+1 



and for fc = n, we have 



/? n raT( ra£) ) 

Pr{X£ = n} = Pr{£>J B = 1, 1 < < < n} = r (w + np) ' (4-6) 

Next , we consider the limiting distribution of X p n : Fix < x < oo and 
suppose that k = [x^n\ , then we have 



Pr{X? = k} = kp k (I + p) 



(np + fc) fc+1 



n v p y (i + — ) • • • (i + — ; 



1 + '^exp(-< 1+ /^ * . (47) 



Hence X^/y/n converges in distribution to a variable X p with density 

fx,(*) = (^) ^exp (- (1+ 2 ^ 2 ) for x > 0, (4.8) 
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and hence E(X£) ~ \J 2(i+ P ) ■ denote the number of components in 

G p w then, using standard arguments (see [41]), it follows from Corollary 3 



and Theorem 4 that (Nfi — \ log n)/ y \ log n converges in distribution to the 

standard N(0, 1) distribution. 

We apply Theorem 3 and Theorem 4 to obtain the distribution for Cf (n): 



Pr{Cf (n) = £} = Pr{Ci 



D(p,n) 



= ipr{Bf^}Pr y^D>; n (1.9) 

n I i=i i=i 

for 1 < £ < n. To obtain a local limit theorem for the distribution of Cf(ra), 
fix < a; < oo and suppose that £ = [xn\ . Then it follows from Theorem 4, 
Corollary 2, (4.5), and (4.6) that 



e-i 



Pr{B? = £ £pr{X? ^ = A;} = + p) 



0* 



+ 



k=i 



k=i 

„2 



1 + ^ exp f- |fc 



VI Jo \ P J ~~ r V 2 P / VP 
Also, since the variables D P ,D%, .. are independent, we have 



p n i\T{ip) 

{(p+ //)/,+, ' r(£ + £ P ) 

(4.10) 



1 + P 




i=i 



n 



i=i 



Pr{Er=i A"} 



(4.11) 



T{£{\ + p))r((n - l)(l + p))r(np) 
Y{ip)Y{{n-i)p)Y{n{l+p)) 



n 



1 + pV 2vr£(n - f) 



Substituting (4.10) and (4.11) into (4.9), we obtain 



Pr{Cf(n) 



Pr{Cf(n) = L^nJ} 



2^^ 



(4.12) 
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So, converges in distribution to Z\ as n — > oo, where Z\ has density 

fziiu) = 0(1 — u) 9 ^ 1 on the interval (0,1) with 9 = 1/2. It follows from 
Theorem 3 and similar calculations that for any integer t > 1 and constants 

< Oj < bi < 1, where 1 < i < t, 

di < = < 6j, 1 < i < t\ 

n — (J D (P' n ) _ . . . _ Q D (P< n ) | 

= ny ai 2vr^ dx ' 

Hence, it follows from standard arguments (see, for example [22]) that: 

Theorem 5. The joint distribution of the normalized order statistics for the 
component sizes in G p n converges to the Poisson-Dirichlet (1/2) distribution 
on the simplex V = {{xi} J^Xi < l,Xi > x,i + \ > for every i > 1}. 

An Anti-Preferential Attachment Model 

In this section we define T™ : [n] — > [n], a random mapping with 'anti- 
preferential attachment', where m > 1 is a fixed interger parameter. For 

1 < k < n, we define T™(k) = Y^ n) where, as in the definition of T^, the 
variables Y^ m ' n \ y 2 ( - m ' n ' ) , .., Yn m ' n ^ depend on the evolution of an urn scheme. 
The distribution of each variable Y^ m ' n ^ is determined by a (random) n-tuple 
of non-negative weights b(k) = (bi(k), b2(k), b n (k)) where, for 1 < j ' < n, 
bj(k) is the number of balls in the j th urn at the start of the k th round of the 
urn scheme. Specifically, given b{k) — b — (bi, b n ), we define 

Pr{y^=i|6(fc)=6} = s ^_. 

The random weight vectors 6(1), 6(2), 6(n) associated with the urn scheme 
are determined recursively. For k = 1, we set 6i(l) = 6 2 (1) = ••• = 6 n (l) = m. 
For k > 1, b(k) depends on both b(k — 1) and the value of Y^™f^ as follows: 
Given that Y^ n ^ = j, we set bj(k) = bj(k — 1) — 1 and for all other i ^ j, 

we set bi(k) = bi(k — 1) (i.e. if = j then a ball is removed from the j th 

urn). 
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The random mapping T™ as defined above is an anti-preferential at- 
tachment model in the following sense. Since, for 1 < k < n, we have 
T™(k) = Y^ m ' n \ and since the (conditional) distribution of Y^ m,n ^ depends 
on the state of the urn scheme at the start of round k, it is clear that vertex 
k is less likely to ' choose' vertex j if the weight bj(k) is (relatively) small, i.e. 
if several of the vertices 1, 2, k — 1 have already been mapped to vertex j. 
It is also clear from the definition of T™ that the in-degree of any vertex in 
the random digraph G™ = G{T™) is at most m and in the case m — 1, is 
a (uniform) random permutation. In the following proposition we determine 
the distribution of T™. 



- n 



Theorem 6. Suppose that D™, D™, ... are i.i.d. Bin(m, ^) variables where 
m > 1 is a fixed integer parameter. 

Let Dim, n) = (D™ n , D™ w ...,D™ n ) be a sequence of variables with joint dis- 
tribution given by 

Pr0™ = di, 1 < i < n} = Pr J D? = d t , 1 < i < n £ D? = 

{ i=i 

Then the random mappings T™ and T n D(m,n) have the same distribution. 
Proof. To prove the result it is enough to show that for any n > 1 and any 

feM n 

Pr{T™ = /} = Pr{T°^ = /}. 

Suppose that / G M. n and that d(f) = (d±, d2, d n ). It is straightforward 
to check that 



Pi{D™ n =di,l<i<n} = Pi jA™ = d h l<i<n 

niii © 



n 

i=i 



(T) 

and hence, from the definition of Tn^ m ' n \ that 



p r r T D(m,n) _ f \ _ " " " v n,;=i (dj) 
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= nr=i( m k 

(nm) n 

— # 

On the other hand, T™ = f with d(f) = (di, d%, d n ) if and only if for 
each 1 < i < n, di balls are removed, in a certain order, from the i th urn 
during the evolution of the urn model described above. So it follows from 
the definition of T™ in terms of the urn scheme, that 

pr{T T r = /> = n r i( T k 

(nm) n 

The result follows since n > 1 and / G M n were arbitrary. □ 

Again, it is clear from Theorem 6 that the order in which a realisation of 
T™ is sequentially constructed does not matter. In particular, suppose that 
ii,i 2 ,...,i n is a permutation of [n] and for 1 < k < n, let T™(ik) = Yn 171 '^ 
where the variables Y^ m ' n \ F 2 < - m ' n \ Yn™'^ are as defined above. Then it 
follows from the proof of Theorem 6 that T™ ~ x^ 3 *" 1 '™) ?L t™. 

We apply Theorem 6 to investigate the distributions of the number of 
cyclic vertices in G™ = G(T™) and the size of a typical component in G(T™). 
Let X™ denote the number of cyclic vertices in the random digraph G™ = 
G{T™) and let C™(n) denote the size of the component in G™ which contains 

the vertex 1. Since T™ I T° (m ' n) , we have G(T n m ) ~ G? ( m ' n) and X? ~ 
-^■D{m,n) ^ ^ f jj owg f rom Theorem 1 and Theorem 6 (and its proof) that 
for m > 2 and 1 < /c < n, we have 

PrW = = Pr{xf M = A;} = ^E((D™ n - l)D? n D?, n ■ ■ ■ D k , n ) 



1 ( m \ I m\ 

Y, (di " 1)^1* • • • 4 x UJ ' ' ' UJ 



n-ife ^ v ' ( nrn ) 

d s.t. E?=i*=" 

min(n,fcm) /rrA /rrA 

E E 

i=fc+l d - a . t . E J =idj=t I n ) 

and d^=n 
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min(n,fcm) ( m \ ( m \ ( nm ~km\ 

K / _, x , , , . . \di) ' ' ' \d k ) \ n-t ) 



n-fc ^ ^ v ' ( nm ) 

min(n,fcm) 



min(n,fcm) /m— 2\ (m— 1\ /m— 1\ lnm—km\ 

m (m 



n-k v 7 ^ ^ ( nm ) 

min(n,fem) /fcm-fc-l\ (nm—km\ 

m k (m-l) V ' t ~ fc ~ 1 ^ n ~ f j 

t=k+l v " / 

, (nm— k— 1\ 

k k 1 1 x V n-k-1 ) 

-m (m — 1)- 



n-A; (T) ' 

In the summations above the sum is always taken over those degree sequences 
for which the binomial coefficients are defined. We also adopt the formal 
convention that (°) = 1. Finally, for k = n and m > 2, we obtain 

Pr{X n m = n} = Pr{A™ = 1, 1 < < < n} 



(T) ' 



and when m — 1, we have X* = ra. 

To obtain a local limit theorem for X™, fix < x < 00 and suppose that 
A; = L^V^-J • Then we have 

, (nm— k— 1\ 

^{K = k} = ^m k (m - 1)^=*=±L 
- k -m\m-l)^ 



n — k (nm) 



k+l 



m — 1\ / — (m — l)x 2 \ 1 



m J \ 2m J y/n 

It follows that X^/^Jn converges in distribution to a variable X m with den- 
sity 

/x fm-l\ f-(m-l)x 2 \ 

fx \ x ) — \ rrexp — — for x > 0, 

JXmK J I m P \ 2m J 



and hence E(X™) ~ \J 2(m-i) • denote the number of components in 

G™. Then, by the same argument as used for N? above, it follows that 
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(N™ — | log n) I ^\ log n converges in distribution to a standard iV(0, 1) dis- 
tribution. 

We apply Theorem 3 and Theorem 6 to obtain the distribution for C™(n): 
Pr{CrH = = Pr{Cf {rn ' n) = £} 



i 



= ^Pr{Bf (m/) }Pr I D ? H D ^ = n \ ( 4 - 13 ) 



i=i 



i=i 

for 1 < £ < n. To obtain a local limit theorem for the distribution of C™(n), 
fix < x < oo and suppose that £ = [xn\ . Then it follows from Theorem 6 
and Corollary 2, that 

Pr{Bf (m/) } = £ ipr{Xf {m/) = A;} 
fe=i 

€— 1 u , tira—k — \\ « 

E m ft (m - 1) ( t _ k _ x ) lm' 
P _ U (tm\ p ' 



m - 1\ ( (m- l)x 2 \ m , , , 

ex P ~ dx = • ( 4 - 14 ) 




VI Jo V m / V 2m y \ m 

Since D™,D™, ... are i.i.d. Bin(m, — ) variables, we also have 

^ ( n /m£\ /mn—mi\ , 



1=1 1=1 



(7) V27rf(m-l)(n-f)- 

(4.15) 

Substituting (4.14) and (4.15) into (4.13) we obtain 

Pr{C7(n) =£} r as n -> oo. (4.16) 

2nVl — x 

It follows that as n — > oo, 1 ^ converges in distribution to Zi, where Z\ has 
fz^u) = 9(1 — u) 8 ^ 1 on the interval (0, 1) with parameter 9 = 1/2. Again, as 
in the case of the preferential attachment model, it is not difficult to extend 
(4.16) in order to show that: 

Theorem 7. The joint distribution of the normalized order statistics for the 
component sizes in G™ converges to the Poisson-Dirichlet (1/2) distribution 
on the simplex V = {{xi} '■ ^,Xi < l,Xi > x i+ i > for every i > 1}. 
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5 Final Remarks 



In this paper we have introduced a new random mapping model, T„ , which is 
defined in terms of a collection of exchangeable, non-negative, integer-valued 
'in-degree' variables D 1: D 2 , D n such that Y^=i A = n - We note the clas- 
sic model T p ( ra ) has the property that each vertex independently chooses its 

image under T p / n y In we have replaced this independence with exchange- 
ability of the in-degrees in (and in the examples with the 'independence' 
of random variables which are used to define the in-degrees). 

We have shown that the joint distribution of the variables Di, D 2) D n is 
the key to understanding both the local and global structure of and that 
the distributions of several variables associated with the structure of can 
be expressed in terms of expectations of various functions of D\, D 2 , D n . In 
the special case where the variables Di, D 2 , D n have the same distribution 
as a collection of i.i.d. variables Di, D 2 , D n conditioned on ^2™ =1 Di = 
n, we have shown that it is easy to apply our results to obtain exact and 
asymptotic distributions for the number of cyclic vertices, the number of 
components, and the size of a typical component in G%. Both T p(n ) and Zjf 
share the property that these mappings restricted to the cyclical vertices are 
both uniform random permutations on the cyclical vertices. This means that 
given the distribution of the number of cyclic vertices, X® ', (see Theorem 1) 
we can exploit known results for uniform random permutations to obtain 
results for T£, and it explains some similarities between results for both 
T p(n) and . 

The transparent relationship between the distribution of the variables 
Di, D 2 , D n and the structure of random mapping T® is the main advan- 
tage of this new model. It also provides the basis for testing and fitting ran- 
dom mapping models in various applications. For example, if D 1 , D 2 , D n 
have the same distribution as n independent Poisson(l) variables, Di,D 2 , .. 
..,D n , conditioned on Y^=i-Di = n, then is a uniform random mapping 
and, for large n and 1 < % < n, the marginal distribution of each L>i is ap- 
proximately Poisson{l). So vertex in-degree data, such as directed epidemic 
contacts, can be used to test whether the associated mapping is a realisation 
of a uniform random mapping model. 

As another example, we mention the work of Arney and Bender on ran- 
dom mappings with constraints on coalescence [4]. Their work was motivated, 
in part, by the analysis of shift register data. In order to model a random 
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shift register they put a uniform measure on M.\ ' ' , the set of all mappings 
/ : [n] — > [n] such that, for every 1 < i < n, the number of pre- 

images of i under /, equals 0, 1, or 2. So, iff G 7Wi 0,1 ' 2} , then every vertex in 
G n (f) has in-degree equal 0, 1, or 2. Arney and Bender observed that in some 
respects their model does not fit the shift register data. In particular, their 
model predicts 0.293n vertices with in-degree whereas the average number 
of vertices with in-degree in a random shift register is n/4. By using the 
model instead, we can more successfully capture the local structure of the 
shift register data. Specifically, suppose that Di, D 2 , D n have the same 
distribution as n independent Bin(2, |) variables, D±, D 2 , D n , conditioned 
on Y^i=i Di = n. Then Pr{Di = 0} = \{1 + — Z2 )~ X anc ^ ^ ne ex P ec ted num- 
ber of vertices with in-degree in is asymptotic to \. We note that, for 
example, the asymptotic distribution of the normalised typical component 
size is the same for both the Arney and Bender model and . So it is 
not surprising that Arney and Bender found that their model fit some other 
features of the shift register data quite well. 

Finally, we mention that in applications of random mapping models in 
cryptology and in epidemic process modelling the distributions of the number 
of predecessors and of the number of successors of an arbitrary vertex or set 
of vertices are also of interest. We develop a calculus for computing these 
distributions based on the underlying variables D±, D 2 , D n in a companion 
paper [23]. 
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